-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Feature/large pipeline #2706
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature/large pipeline #2706
Conversation
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
src/sagemaker/workflow/pipeline.py
Outdated
@@ -102,37 +105,65 @@ def create( | |||
description (str): A description of the pipeline. | |||
tags (List[Dict[str, str]]): A list of {"Key": "string", "Value": "string"} dicts as | |||
tags. | |||
parallelism_config (Optional[Config for parallel steps, Parallelism configuration that |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
(Optional[ParallelismConfiguration]): parallelism configuration that applied to each of the executions of the pipeline.
Can you update the doc string in other places as well.
src/sagemaker/workflow/pipeline.py
Outdated
@@ -223,6 +265,8 @@ def start( | |||
pipeline parameters. | |||
execution_display_name (str): The display name of the pipeline execution. | |||
execution_description (str): A description of the execution. | |||
parallelism_config (Optional[ParallelismConfiguration]): Config for parallel steps, that |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
(Optional[ParallelismConfiguration]): parallelism configuration that applied to the newly started execution. It takes precedence over the parallelism configuration of the parent pipeline.
tests/integ/test_workflow.py
Outdated
except Exception: | ||
pass | ||
|
||
def test_create_parallelism_config(sagemaker_session, role, pipeline_name, region_name): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
test_create_and_update_with_parallelism_config
try: | ||
response = pipeline.create(role, parallelism_config={"MaxParallelExecutionSteps": 50}) | ||
create_arn = response["PipelineArn"] | ||
assert re.match( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also do a describe call to verify the configuration
pipeline.parameters = [ParameterInteger(name="InstanceCount", default_value=1)] | ||
response = pipeline.update(role, parallelism_config={"MaxParallelExecutionSteps": 50}) | ||
update_arn = response["PipelineArn"] | ||
assert re.match( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also do a describe call to verify the configuration
pipeline.parameters = [ParameterInteger(name="InstanceCount", default_value=1)] | ||
response = pipeline.update(role) | ||
update_arn = response["PipelineArn"] | ||
assert re.match( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do a describe call to verify the configuration
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please fix the unit test black-check errors.
Fixed formatting error |
Codecov Report
@@ Coverage Diff @@
## master #2706 +/- ##
=========================================
Coverage ? 88.80%
=========================================
Files ? 168
Lines ? 14724
Branches ? 0
=========================================
Hits ? 13076
Misses ? 1648
Partials ? 0
Continue to review full report at Codecov.
|
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
Co-authored-by: Basil Beirouti <[email protected]>
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
Please fix conflicts and address feedback to make this eligible for review again |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
closing this. Please reopen when working on it.
Changes merged in: #2825 |
Issue #, if available:
Add support for large pipeline definition and step parallelism configuration
Description of changes:
Updated the create/update pipeline to upload large pipeline definition (greater than 100Kb) to S3 bucket and replaces
PipelineDefinition
withPipelineDefinitionS3Location
Testing done:
Merge Checklist
Put an
x
in the boxes that apply. You can also fill these out after creating the PR. If you're unsure about any of them, don't hesitate to ask. We're here to help! This is simply a reminder of what we are going to look for before merging your pull request.General
Tests
unique_name_from_base
to create resource names in integ tests (if appropriate)By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.